Assessing thesaurus-based annotations for semantic search applications

نویسندگان

  • Kai Eckert
  • Magnus Pfeffer
  • Heiner Stuckenschmidt
چکیده

Statistical methods for automated document indexing are becoming an alternative to the manual assignment of keywords. We argue that the quality of the thesaurus used as a basis for indexing in regard to its ability to adequately cover the contents to be indexed and as a basis for the specific indexingmethod used is of crucial importance in automatic indexing.We present an interactive tool for thesaurus evaluation that is basedona combinationof statisticalmeasures and appropriate visualisation techniques that supports the detection of potential problems in a thesaurus. We describe the methods used and show that the tool supports the detection and correction of errors, leading to a better indexing result.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finnish National Ontologies for the Semantic Web - Towards a Content and Service Infrastructure

We present a national ontology development and service framework being developed in Finland in 2003-2007. The framework is based on a set of related core ontologies, most notably on a national upper ontology based on the commonly used Finnish General Thesaurus YSA maintained by the National Library of Finland. The framework implements three ontology services by a web-based system ONKI. Firstly,...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Semantic Notation and Retrieval in Art and Architecture Image Collections

In this paper, we analyze various methods used for semantic annotation and search in a collection of art and architecture images. We discuss the Art and Architecture Thesaurus, WordNet, ULAN and Iconclass ontology. Systems for searching and retrieval art and architecture image collections are presented. We explore if the MPEG 7 descriptors are useful for art and architecture image annotations. ...

متن کامل

Creating a National Content and Service Infrastructure for the Finnish Semantic Web

We present a national ontology development and service framework being developed in Finland in 2003-2007. Our goal is to initiate and support collaborative ontology development processes of various expert groups now developing keyword thesauri. The framework is based on a set of related core ontologies, most notably on a national upper ontology based on the commonly used Finnish General Thesaur...

متن کامل

Enhancing Web Search with Heterogeneous Semantic Knowledge

This paper explores four kinds of semantic knowledge to improve keyword-based Web search, including thesauruses, categories, ontologies, and social annotations. These heterogeneous semantic knowledge represent meanings of Web information, thus they can be used to improve search results in respect of semantic relevance. Currently, different semantic search paradigms have been developed for diffe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJMSO

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2008